Skip to content

opencl: add q4_0 MoE GEMM for Adreno#22731

Merged
lhez merged 6 commits intoggml-org:masterfrom
qualcomm:sg/moe-clc-upstream-q4_0
May 8, 2026
Merged

opencl: add q4_0 MoE GEMM for Adreno#22731
lhez merged 6 commits intoggml-org:masterfrom
qualcomm:sg/moe-clc-upstream-q4_0

Conversation

@shawngu-quic
Copy link
Copy Markdown
Contributor

Overview

Add Q4_0 MoE OpenCL optimizations for Adreno.

Requirements

@shawngu-quic shawngu-quic requested a review from a team as a code owner May 5, 2026 20:47
@github-actions github-actions Bot added ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend labels May 5, 2026
@lhez lhez changed the title Sg/moe clc upstream q4 0 opencl: add q4_0 MoE GEMM for Adreno May 5, 2026
Comment thread ggml/src/ggml-opencl/kernels/gemm_moe_q4_0_f32_ns.cl Outdated
Comment thread ggml/src/ggml-opencl/kernels/gemm_moe_q4_0_f32_ns.cl Outdated
@lhez lhez force-pushed the sg/moe-clc-upstream-q4_0 branch from abdbf13 to d2e57a6 Compare May 6, 2026 07:05
@lhez lhez merged commit f3e8d14 into ggml-org:master May 8, 2026
108 of 110 checks passed
cetarthoriphros pushed a commit to cetarthoriphros/llama.cpp that referenced this pull request May 9, 2026
* Q4_0 MoE CLC pass sanity check

* release program

* opencl: fix whitespace

* opencl: remove unused cl_program

* opencl: break #if block to make it more clear

* opencl: adjust format

---------

Co-authored-by: Li He <lih@qti.qualcomm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning OpenCL Issues specific to the OpenCL backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants